Lifelong Reinforcement Learning Timeline - Concepedia

Concepedia

Concept

lifelong reinforcement learning

Parents

Lifelong Learning

Children

CompositionalityModularityMulti-task LearningReinforcement Learning (Computer Engineering)Reinforcement Learning (Educational Psychology)

1.4K

Publications

86.9K

Citations

3.9K

Authors

943

Institutions

Continual Reinforcement Learning Transfer

2013 - 2019

During this era, reinforcement learning research increasingly focused on learning across sequences of tasks by integrating modular and hierarchical architectures, expert gating, and progressive task curricula. These patterns supported transfer of knowledge and reduced catastrophic forgetting as agents encountered varied environments and objectives; emphasis fell on memory-efficient replay strategies, task-aware memory management, and structured knowledge reuse across tasks. Exploration and representation learning were enhanced through intrinsic motivation, stochastic perturbations, and entropy-regularized objectives, driving robust and diverse behaviors in progressively richer environments, often with cross-domain challenges. Researchers embraced scalable architectures such as modular networks, networks of experts, and differentiable planning to enable seamless transfer and growth, while cross-domain transfer and successor-feature formalisms helped generalize policies across domains.

• Continual/lifelong RL builds sequential task capabilities by modular networks, expert gating, and curriculum-style task progression, enabling knowledge to transfer and avoid forgetting across tasks [6], [5], [10], [19].

• Optimizing memory usage in RL via prioritized sampling [1], curated replay databases [20], and hierarchical replay [16] to improve sample efficiency and knowledge reuse across tasks [17].

• Exploration enhancements combine intrinsic motivation [2], stochastic weight perturbations [18], and entropy-based objectives [4] to drive diverse behaviors and more reliable learning in RL, with environmental challenges highlighted by rich environments [11].

• Hierarchical and modular architectures enable scalable transfer across tasks via temporal abstraction and planning modules [2], progressive networks [6], network-of-experts [5], and differentiable planning [9], with multi-domain dialogue [8] illustrating cross-domain application.

• Cross-domain transfer is formalized with successor features and generalized policy improvement [13], zero-shot transfer from task features [3], and cross-domain lifelong transfer RL [19], with hierarchical replay supporting transfer [16].

Popular Keywords

artificial intelligence

machine learning

deep reinforcement learning

[1]

Prioritized Experience Replay

2015 • artificial intelligence, deep reinforcement learning, exploration v exploitation, intelligent systems, machine learning, reinforcement learning (educational psychology), robot learning, sequential decision making

[2]

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation

2016 • artificial intelligence, deep reinforcement learning, machine learning, reinforcement learning (educational psychology), robot learning, sequential decision making

[3]

Using task features for zero-shot knowledge transfer in lifelong learning

2016 • artificial intelligence, intelligent systems, learning control, machine learning, robot learning

[4]

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

2018 • artificial intelligence, deep reinforcement learning, exploration v exploitation, machine learning, reinforcement learning (educational psychology), sequential decision making

[5]

Expert Gate: Lifelong Learning with a Network of Experts

2017 • artificial intelligence, machine learning

[6]

Progressive Neural Networks

2016 • artificial intelligence, learning control, machine learning, robot learning

[7]

Count-Based Exploration with Neural Density Models

2017 • artificial intelligence, deep reinforcement learning, exploration v exploitation, machine learning, reinforcement learning (computer engineering), reinforcement learning (educational psychology), robot learning, sequential decision making

[8]

Deep Reinforcement Learning for Multi-Domain Dialogue Systems

2016 • artificial intelligence, deep reinforcement learning, machine learning, reinforcement learning (computer engineering), reinforcement learning (educational psychology)

[9]

Value Iteration Networks

2017 • artificial intelligence, deep reinforcement learning, machine learning, reinforcement learning (computer engineering), reinforcement learning (educational psychology), robot learning, sequential decision making

[10]

Active Task Selection for Lifelong Machine Learning

2013 • artificial intelligence, intelligent systems, machine learning, robot learning

[11]

Emergence of Locomotion Behaviours in Rich Environments

2017 • artificial intelligence, deep reinforcement learning, intelligent systems, reinforcement learning (educational psychology), robot learning

[12]

Weighted importance sampling for off-policy learning with linear function approximation

2014 • artificial intelligence, deep reinforcement learning, exploration v exploitation, machine learning, reinforcement learning (computer engineering), reinforcement learning (educational psychology), sequential decision making

[13]

Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement

2019 • artificial intelligence, deep reinforcement learning, intelligent systems, machine learning, reinforcement learning (educational psychology), robot learning, sequential decision making

[14]

A Study on Overfitting in Deep Reinforcement Learning

2018 • artificial intelligence, deep reinforcement learning, exploration v exploitation, machine learning, reinforcement learning (computer engineering), reinforcement learning (educational psychology)

[15]

Deep Q-learning From Demonstrations

2018 • artificial intelligence, deep reinforcement learning, exploration v exploitation, machine learning, reinforcement learning (computer engineering), reinforcement learning (educational psychology)

[16]

Knowledge Transfer for Deep Reinforcement Learning with Hierarchical Experience Replay

2017 • artificial intelligence, deep reinforcement learning, machine learning, robot learning

[17]

A Deeper Look at Experience Replay

2017 • artificial intelligence, deep reinforcement learning, exploration v exploitation, machine learning, reinforcement learning (educational psychology), robot learning, sequential decision making

[18]

Noisy Networks for Exploration

2017 • artificial intelligence, deep reinforcement learning, exploration v exploitation, machine learning, reinforcement learning (computer engineering), reinforcement learning (educational psychology), robot learning

[19]

Autonomous cross-domain knowledge transfer in lifelong policy gradient reinforcement learning

2015 • artificial intelligence, intelligent systems, learning control, machine learning, robot learning

[20]

The importance of experience replay database composition in deep reinforcement learning

2015 • artificial intelligence, deep reinforcement learning, intelligent systems, machine learning, reinforcement learning (educational psychology), sequential decision making

Continual Lifelong Reinforcement Learning

2020 - 2023